Manual and Automatic Transcriptions in Dementia Detection from Speech
نویسندگان
چکیده
As the population in developed countries is aging, larger numbers of people are at risk of developing dementia. In the near future there will be a need for timeand cost-efficient screening methods. Speech can be recorded and analyzed in this manner, and as speech and language are affected early on in the course of dementia, automatic speech processing can provide valuable support for such screening methods. We present two pipelines of feature extraction for dementia detection: the manual pipeline uses manual transcriptions while the fully automatic pipeline uses transcriptions created by automatic speech recognition (ASR). The acoustic and linguistic features that we extract need no language specific tools other than the ASR system. Using these two different feature extraction pipelines we automatically detect dementia. Our results show that the ASR system’s transcription quality is a good single feature and that the features extracted from automatic transcriptions perform similar or slightly better than the features extracted from the manual transcriptions.
منابع مشابه
Comparison of forced-alignment speech recognition and humans for generating reference VAD
This present paper aims to answer the question whether forced-alignment speech recognition can be used as an alternative to humans in generating reference Voice Activity Detection (VAD) transcriptions. An investigation of the level of agreement between automatic/manual VAD transcriptions and the reference ones produced by a human expert was carried out. Thereafter, statistical analysis was empl...
متن کاملApplication-oriented validation o preliminary r
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recognition technology. Such automatic phonetic transcriptions are usually validat...
متن کاملDepression Detection Using Automatic Transcriptions of De-Identified Speech
Depression is a mood disorder that is usually addressed by outpatient treatments in order to favour patient’s inclusion in society. This leads to a need for novel automatic tools exploiting speech processing approaches that can help to monitor the emotional state of patients via telephone or the Internet. However, the transmission, processing and subsequent storage of such sensitive data raises...
متن کاملA pplication-orien ted validation o f phonetic transcriptions: prelim inary results
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recogni tion technology. Such automatic phonetic transcrip tions are usually val...
متن کاملTitle : Automatic Phonetic Transcription of Large Speech Corpora
Most large speech corpora are delivered with a lexicon that contains a canonical transcription of every word in the orthographic transcription. Such a lexicon can be used for generating a hypothetical ‘canonical’ phonetic transcription from the orthography. In addition, time and money permitting, some speech corpora are provided with a manually verified broad phonetic transcription of at least ...
متن کامل